Formant estimation based on temporal synchronous analysis
نویسندگان
چکیده
The accuracy of formant frequency estimation on voiced speech in frame-based linear predictive analysis is affected by the position of the analysis frame relative to the instant of onset of vocal tract excitation. An automatic waveform-dependent point-wise analysis which employs a weighted least-square lattice (WLSL) algorithm to minimise these errors is described here. Experiments on both synthetic speech and real speech are included to show that the algorithm offers improved accuracy in comparison to the frame-based method.
منابع مشابه
Formant Estimation and Tracking Using Deep Learning
Formant frequency estimation and tracking are among the most fundamental problems in speech processing. In the former task the input is a stationary speech segment such as the middle part of a vowel and the goal is to estimate the formant frequencies, whereas in the latter task the input is a series of speech frames and the goal is to track the trajectory of the formant frequencies throughout t...
متن کاملOn Short-Time Estimation of Vocal Tract Length from Formant Frequencies
Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework ...
متن کاملAn Effective Attack-Resilient Kalman Filter-Based Approach for Dynamic State Estimation of Synchronous Machine
Kalman filtering has been widely considered for dynamic state estimation in smart grids. Despite its unique merits, the Kalman Filter (KF)-based dynamic state estimation can be undesirably influenced by cyber adversarial attacks that can potentially be launched against the communication links in the Cyber-Physical System (CPS). To enhance the security of KF-based state estimation, in this paper...
متن کاملPsychoacoustical evaluation of the pitch-synchronous overlap-and-add speech-waveform manipulation technique using single-formant stimuli.
This article presents two experiments dealing with a psychoacoustical evaluation of the pitch-synchronous overlap-and-add (PSOLA) technique. This technique has been developed for modification of duration and fundamental frequency of speech and is based on simple waveform manipulations. Both experiments were aimed at deriving the sensitivity of the auditory system to the basic distortions introd...
متن کاملStatistical properties of linear prediction analysis underlying the challenge of formant bandwidth estimation.
Formant bandwidth estimation is often observed to be more challenging than the estimation of formant center frequencies due to the presence of multiple glottal pulses within a period and short closed-phase durations. This study explores inherently different statistical properties between linear prediction (LP)-based estimates of formant frequencies and their corresponding bandwidths that may be...
متن کامل